CounterPoint: Using Hardware Event Counters to Refute and Refine Microarchitectural Assumptions Nick Lindsay (Yale), Caroline Trippel (Stanford), Anurag Khandelwal (Yale), Abhishek Bhattacharjee (Yale)
Finding Reusable Instructions via E-Graph Anti-Unification Youwei Xiao (Peking University), Chenyun Yin (Peking University), Yitian Sun (Peking University), Yuyang Zou (Peking University), Yun Liang (Peking University)
Lifetime-Aware Design of Item-Level Intelligence Shvetank Prakash (Harvard), Andrew Cheng (Harvard), Olof Kindgren (Qamcom), Ashiq Ahamed (PragmatIC), Graham Knight (PragmatIC), Jed Kufel (PragmatIC), Francisco Rodriguez (PragmatIC), Arya Tschand (Harvard), David Kong (Harvard), Mariam Elgamal (Harvard), Jerry Huang (Harvard), Emma Chen (Harvard), Gage Hills (Harvard), Richard Price (PragmatIC), Emre Ozer (PragmatIC), Vijay Janapa Reddi (Harvard)
PF-LLM: Large Language Model Hinted Hardware Prefetching Ceyu Xu (HKUST), Xiangfeng Sun (HKUST), Weihang Li (Duke), Chen Bai (HKUST), Bangyan Wang (HKUST), Mengming Li (HKUST), Zhiyao Xie (HKUST), Yuan Xie (HKUST)
vCXLGen: Automated Synthesis and Verification of CXL Bridges for Heterogeneous Architectures Anatole Lefort (TU Munich), Julian Pritzi (TU Munich), Nicolò Carpentieri (TU Munich), David Schall (TU Munich), Simon Dittrich (TU Munich), Soham Chakraborty (TU Delft), Nicolai Oswald (NVIDIA), Pramod Bhatotia (TU Munich)
Best Paper Honorable Mentions
RowArmor: Efficient and Comprehensive Protection Against DRAM Disturbance Attacks
Linear Layouts: Robust Code Generation of Efficient Tensor Computation Using F₂
RTeAAL Sim: Using Tensor Algebra to Represent and Accelerate RTL Simulation
MSCCL++: Rethinking GPU Communication Abstractions for AI Inference
PACT: A Criticality-First Design for Tiered Memory
M2XFP: A Metadata-Augmented Microscaling Data Format for Efficient Low-bit Quantization
RedFuser: An Automatic Operator Fusion Framework for Cascaded Reductions on AI Accelerators
A Data-Driven Dynamic Execution Orchestration Architecture
Highly Automated Verification of Security Properties for Unmodified System Software
Graphiti: Formally Verified Out-of-Order Execution in Dataflow Circuits
SuperOffload: Unleashing the Power of Large-Scale LLM Training on Superchips
Influential Paper Awards
A Hardware Design Language for Timing-Sensitive Information-Flow Security Danfeng Zhang, Yao Wang, G. Edward Suh, Andrew C. Myers
An Adaptive, Non-Uniform Cache Structure for Wire-Delay Dominated On-Chip Caches Changkyu Kim, Doug Burger, Stephen W. Keckler
Overshadow: A Virtualization-Based Approach to Retrofitting Protection in Commodity Operating Systems Xiaoxin Chen, Tal Garfinkel, E. Christopher Lewis, Pratap Subrahmanyam, Carl A. Waldspurger, Dan Boneh, Jeffrey Dwoskin, Dan R.K. Ports